Prediction of Ligand Binding sites in RNA binding protein Pockets using support vector machines
نویسندگان
چکیده
RNA-binding proteins play a significant role in pattern regulation of gene expression during developmental phases. Therefore in order to facilitate our understanding of organism development there is a continuous need to develop an extensive a priori method for the prediction of RNA-binding protein pockets. We present here a SVM (Support Vector Machine) based approach for successful prediction of these pockets. The method employs two datasets: the protein sequences of the RNA binding protein pockets and the non-RNA binding protein pockets, both of which when combined to form the positive and negative datasets to be fed into the SVM model. Before feeding the data to the SVM, both the datasets were crossed with several steps of sorting, which refined the selection process of obtaining ranked features of these datasets. Analysis was applied on 3 different featured datasets viz FPOCKET, Zernike and shell features. The results suggest that the top 10 features of shell are very important and play a pivotal role in the classification and prediction of ligand binding sites in RNA binding proteins. An accuracy of 89.3% was achieved when evaluated. This study demonstrates that it is possible to predict ligand binding sites in RNA binding protein pockets using its sequence.
منابع مشابه
Identification of RNA-binding sites in artemin based on docking energy landscapes and molecular dynamics simulation
There are questions concerning the functions of artemin, an abundant stress protein found in Artemiaduring embryo development. It has been reported that artemin binds RNA at high temperatures in vitro, suggesting an RNA protective role. In this study, we investigated the possibility of the presence of RNA-bindingsites and their structural properties in artemin, using docking energy ...
متن کاملInvestigation the Mechanism of Interaction between Inhibitor ALISERTIB with Protein Kinase A and B Using Modeling, Docking and Molecular Dynamics Simulation
The high level of conservation in ATP-binding sites of protein kinases increasingly demandsthe quest to find selective inhibitors with little cross reactivity. Kinase kinases are a recently discovered group of Kinases found to be involved in several mitotic events. These proteins represent attractive targets for cancer therapy with several small molecule inhibitors undergoing different ph...
متن کاملIn silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties
Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...
متن کاملTargetATPsite: A template-free method for ATP-binding sites prediction with residue evolution image sparse representation and classifier ensemble
Understanding the interactions between proteins and ligands is critical for protein function annotations and drug discovery. We report a new sequence-based template-free predictor (TargetATPsite) to identify the Adenosine-5'-triphosphate (ATP) binding sites with machine-learning approaches. Two steps are implemented in TargetATPsite: binding residues and pockets predictions, respectively. To pr...
متن کاملBindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences
BindN (http://bioinformatics.ksu.edu/bindn/) takes an amino acid sequence as input and predicts potential DNA or RNA-binding residues with support vector machines (SVMs). Protein datasets with known DNA or RNA-binding residues were selected from the Protein Data Bank (PDB), and SVM models were constructed using data instances encoded with three sequence features, including the side chain pK(a) ...
متن کامل